Predicting Epitopes for MHC Molecules

نویسندگان

  • Xueheng Zhao
  • Shanshan Tuo
چکیده

Major Histocompatibility Complex (MHC) plays a key role in immune response by presenting antigenic peptides, which are recognizable to T-cells. Identifying MHCbinding peptides is crucial to understand pathogenesis and develop corresponding vaccines. Direct identification of MHC-binding peptides by biological assays is laborious and expensive, because of the huge size (20) of potential combinations. Current computational methods are also not satisfactory: many of them failed to capture the features of nonconserved motifs, MHC molecule polymorphism, and non-specific binding to low-affinity peptides. Large datasets from National Institute of Health (NIH) public data repository and our own research data, with machine learning methods, provide a great opportunity for addressing this problem. Thus, we aim to develop an efficient and accurate method to identify and predict MHC-binding peptides and to further differentiate various MHC subtypes by their peptide-binding specificities. By selecting ~3000 peptide motifs as features, we built classifiers with Naive Bayes and SVM-based approaches. These classifiers achieved accuracy up to 99% on four most frequent human MHC subtypes— HLA-A01, HLA-A02, HLA-B27, and HLAB08—with 10 fold cross-validation. We applied these classifiers onto experimental data—a pool of potential binders. It has been found that up to 98% peptides are classified as binders and the classifier can be used to determine the specific subtypes. We also applied another approach, iterative statistical search in feature optimization. This iterative approach showed very promising results in our preliminary test. By applying this approach in feature generation, we achieved about 85% accuracy with less than 30 features. Further exploration of this approach will optimize feature space and improve algorithm efficiency. Introduction The immune system acts as a physical barrier against pathogen infections by activating B cells and T cells. B cells and T cells are special types of white blood cells that can recognize “nonself” cells, including pathogen-infected cells, and trigger immune response. In particular, cytotoxic T cell receptors can bind to major histocompatibility complex (MHC) that are antigen-specific receptors on the “nonself” cells and alert immune system to kill these infected cells (Smith-Garvin et al., 2009). Among three classes (I, II, and III) of MHC family, class I is by far the most well characterized subgroup, which we focused on in this study. Class I MHC proteins have a special structure with four antiparallel β-strands in the center region and two α-helices on one side. The two αhelices form a groove that contains six aminoacid binding pockets and can only accommodate short peptides of 8 to 11 amino acids—short fragments of antigens, in other words epitopes(Figure 1; Mester et al., 2011). Identifying and predicting MHC-binding epitopes are essential to understand the cause of diseases and develop corresponding vaccines (Lundegaard et al., 2007). Due to the size of the potential binding peptides (20 = 512 billion) for each MHC molecule (Liao and Arthur, 2011), empirical approach by biology experiments is laborious

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of MHC-peptide binding: a systematic and comprehensive overview.

T cell immune responses are driven by the recognition of peptide antigens (T cell epitopes) that are bound to major histocompatibility complex (MHC) molecules. T cell epitope immunogenicity is thus contingent on several events, including appropriate and effective processing of the peptide from its protein source, stable peptide binding to the MHC molecule, and recognition of the MHC-bound pepti...

متن کامل

T-Cell Epitope Prediction: Rescaling Can Mask Biological Variation between MHC Molecules

Theoretical methods for predicting CD8+ T-cell epitopes are an important tool in vaccine design and for enhancing our understanding of the cellular immune system. The most popular methods currently available produce binding affinity predictions across a range of MHC molecules. In comparing results between these MHC molecules, it is common practice to apply a normalization procedure known as res...

متن کامل

Toward more accurate pan-specific MHC-peptide binding prediction: a review of current methods and tools

Binding of short antigenic peptides to major histocompatibility complex (MHC) molecules is a core step in adaptive immune response. Precise identification of MHC-restricted peptides is of great significance for understanding the mechanism of immune response and promoting the discovery of immunogenic epitopes. However, due to the extremely high MHC polymorphism and huge cost of biochemical exper...

متن کامل

TEPITOPEpan: Extending TEPITOPE for Peptide Binding Prediction Covering over 700 HLA-DR Molecules

MOTIVATION Accurate identification of peptides binding to specific Major Histocompatibility Complex Class II (MHC-II) molecules is of great importance for elucidating the underlying mechanism of immune recognition, as well as for developing effective epitope-based vaccines and promising immunotherapies for many severe diseases. Due to extreme polymorphism of MHC-II alleles and the high cost of ...

متن کامل

Rapid epitope identification from complex class-II-restricted T-cell antigens.

The identification of CD4 + + T-cell epitopes is a laborious and tedious process. Various computer-assisted algorithms have been designed to assist in this process by predicting potential MHC-binding epitopes. Recently, the novel approach of using soluble MHC class II tetramers as a tool to identify T-cell epitopes has been developed successfully. We show that the T-cell epitopes identified usi...

متن کامل

Shift-Invariant Adaptive Double Threading: Learning MHC II - Peptide Binding

The major histocompatibility complex (MHC) plays important roles in the workings of the human immune system. Specificity of MHC binding to peptide fragments from cellular and pathogens' proteins has been found to correlate with disease outcome and pathogen or cancer evolution. In this paper we propose a novel approach to predicting binding configurations and energies for MHC class II molecules,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012